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ABSTRACT 


The two-parameter Weibull distribution has been recognized as a use- 
ful model for survival populations associated with reliability studies 
and life testing experiments. In the analysis of atmospheric data, the 
distributions encountered are often a result of combining two or more 
component distributions. These compound distribu cions are consequently 
of interest to aerospace scientists. Presented is a method for estima- 
tion of the parameters of a compound Weibull distribution with two shape 
parameters, two scale parameters and a proportionality factor. The most 
general case of estimation will be considered in addition to a number of 
special cases that may be of practical value. 
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TECHNICAL MEMORANDUM X-53422 


ESTIMATION OF PARAMETERS IN COMPOUND WEIBULL DISTRIBUTIONS 


SUMMARY 


The two-parameter Weibull distribution has been recognized as a use- 
ful model for survival populations associated with reliability studies 
and life testing experiments. In the analysis of atmospheric data, the 
distributions encountered are often a result of combining two or more 
component distributions. These compound distributions are consequently 
of interest to aerospace scientists. This paper presents a method for 
estimation of the parameters of a compound Weibull distribution with 
density function 

f(x) = QfxCx) + (1 - a) f 2fa) » 0 < a < 1 

where 


fi(x) * 7i e ^ 1 X 71 ’ 1 exp [-x 7l /0i], x 5 0, 0 X > 0, 7l > 0 

and 

f 2<x) * 72 X 72 1 exp [-X 72 /e 2 ], X s 0, 02 > 0, 7 2 > 0. 

The parameters required are a, the proportionality factor, 7^ 70, 0i 
and 0 2 . The most general case of estimation will be considered in addi- 
tion to a number of special cases that may be of practical value. 



I. INTRODUCTION 


The Weibuil distribution, derived in 1939 by W. Weibuil, has been 
recognized as an appropriate model in reliability studies and life test- 
ing. Numerous methods for obtaining efficient estimates of the two 
parameters of this distribution have been outlined in recent years [4, 
10 , 11 ]. 

In actual physical applications, however, a mixture of two Weibuil 
distributions often seems to be a more desirable model. Distributions 
resulting from mixing two or more component distributions are designated 
as "mixed” or "compound." This situation is quite common in the analysis 
of atmospheric data and consequently is of interest to aerospace scientists. 
Compound normal, Poisson and exponential distributions have been studied 
by A. C. Cohen, Jr. [1,2,3]. A method for estimating parameters of mixed 
distributions using sample moments has been outlined by Paul R. Rider [7] 
who considered compound Poisson, binomial, and a special case of the com- 
pound Weibuil distribution. A graphical procedure for estimation of 
mixed Weibuil parameters in life-testing of electron tubes is given by 
John H. K. Kao [6 ] . Although graphical methods have value for locating 
outliers, deriving initial estimates, and for determining whether the 
distribution is as hypothesized, for estimation purposes the analytic 
approach is probably superior. 

This paper represents an attempt at estimating, by the method of 
sample moments, the five parameters of the compound Weibuil distribution 
with density function 


f(x) = of i(x) + (1 - a) f 2 (x), 0 < a < 1 


( 1 ) 


where 


fi(x) = 7i0i lx71 1 ex P [“ x7l /0i3, 
fo(x) = 1 exp [-x 72 /9 2 ], 

C— 


x ^ 0, 0i > 0, 7i > 0 ^ 

). ( 2 ) 

x 5 0, 0 2 > o, 72 > 0> 


The parameters involved are two scale parameters 0! and 0 2 , two shape 
parameters y ± and y 2i and the proportionality parameter a which expresses 
the probability that a given observation x^ comes from the population fi. 


$ 
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The compound cumulative distribution function is defined 


F(x) * aFjXx) + (1 - a) F 2 (x) “1- a exp [-x^VGi] - (1 - a) exp [-x^ 2 /0 2 J. 


Figures 1 and 2 illustrate a generalized mixed Weibull probability 
density function and its corresponding distribution function. 

The most general case of estimation will be considered in which all 
five parameters must be estimated from the data. Also, a number of special 
cases will be investigated in which certain parameters are know in advance 
of sampling or are restricted in some manner. Included will be the special 
case where 7i * 1, i.e., 


fi(x) * 0” 1 exp C-x/Gx], (4) 

which is the well-known exponential distribution. Estimating procedures 
are greatly simplified in these special cases as there are fewer sample 
moments involved in the estimating equations. Also, sampling errors are 
reduced because of the elimination of the need for higher order moments. 


II. ESTIMATION IN THE GENERAL CASE 


The rth theoretical moment about the origin of f(x) is given by 



dx + (1 - a) 


00 


x r f 2 (x) dx, 


(5) 


where fi(x) and f 2 (x) are defined as in equations (2). The first five 
theoretical moments about the origin of (1) follow as 
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MIXED WEIBULL PROBABILITY DENSITY FUNCTION 

FIGURE I 





MIXED WEIBULL CUMULATIVE DISTRIBUTION FUNCTION 

FIGURE 2 
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( 6 ) 


where f is the gamma function, i.e., 


r(k) - j' y k 1 e“ y dy. 
o 

Employing the technique of equating population moments to correspond- 
ing sample moments, the set of equations (6) becomes 

•i * « e i /71 r (jl + 0 + (1 ' a) ®2 /r2 r (i + 0 

m k ■ a ®i /71 r + i) + (l - a) e| /7s r + *) 

S - a e 3 / 71 r (^ + i) + (l - oo e 3/7s r + 1 ) (7) 

®4 ■ « ef 71 r (^ + i^) + (i - a) e ^ 72 r + i) 

■ « e I /71 r (~ + 0 + a * <* 6 2 /7£r (i + 1 ) 
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5) is the ith noncentral moment of the 


where m^ (i ■ 1, 2, . .., 
sample. 


The set of equations (7) is .* system of five equations which must 
be solved simultaneously for estimates of the five parameters 9i* 
0 2 , 7 1 an< * 72 . convenience in handling these equations, we will 

make the following transformations where necessary in this paper. 
(This notation will be used unless stated otherwise.) 

Let 


v = e^ Pi =r (i +1 ) +i =r (i +1 ) 

n = 4 7z fa * r CrI + 0 * r (i + l ) 

P 3 =r (t +l ) * 3=r & +1 ) 


( 8 ) 


Thus, the first three equations of (7) become 


m^ * QVPx + (1 - a) n\Jr! 

(9) 

mV, = QV 2 p 2 + (1 - a) n 2 ^2 

(10) 

m' 3 = av 3 p 3 + (1 - a) n 3 \jr 3 . 

(11) 


Solving (9) for v, substituting this expression into (10), and then 
solving for n yields 

m^l-a'^i^ ± J * (m^) 2 a(l- lv) 8 2 Pi^ 2 + n»2Q:(l-Q:) 2 p2Piti + m^a 2 (l-a) (3*^2 

n - 11 - 


( l - a ) 2 ^£ 2 + ^( 1 - 00^2 


( 12 ) 



Substituting the expression for n from (12) back into (9) and solving 
for v gives 


OPl 


mJd-OWite * J ~ (m^) 2 a(l-a)p2^^ + n^aa-o^fepjilrf + m^Cl-a)^*., 


( 13 ) 


(l-a) 2 ^^ + a(l-a)^a 


Upon substituting the expression for n from (12) and the expression for 
v from (13) into equation (11), we have one equation in the three unknowns 
a, 7 i and y 2 , At this point, it is obvious that explicit expressions for 
the unknown parameters a, y± and y 2 cannot be obtained. Therefore, it 
is suggested that the following procedure be used to obtain a graphical 
estimate of Oi, the proportionality parameter. This method is essentially 
that of Kao [6] and is based upon the fact that a simple Weibull cumula- 
tive distribution becomes a straight line in In versus ^-ln coordinates. 
This method of estimating a will produce a relatively small error in 
the estimating procedure since a is limited to the range 0 < a < 1. 

(1) Plot the sample cumulative distribution function for the 
mixed data on special In versus ln-ln paper and visually 
fit a curve (called Weibull plot) among these points.* 


(2) Starting at each end of the Weibull plot, draw two tangent lines 

and denote them by^F^and (1 - a) F 2 which are estimates of 
aFi(x) and (1 - a)F 2 (x), respectively. 

(3) At the intersection of (1 - q:)F 2 with the upper borderline 

drop a vertical line whose intersection with CCF! as read from 
the percent scale gives our estimate of a . 

See Figure 4 under Section IV for an illustration of this method. 

Once an estimate of a has been determined graphically, solve equa- 
tion (11) for 7 i and y 2 by the following iterative procedure. This 
procedure is taken from Cohen [1] and is a modified Newton-Raphson 
method. 


Assume a value for y ± and solve equation (11) for a first approxi- 
mation to 7 2 . These first approximations can be substituted into (12) 
and (13) to obtain first approximations to 0 X and 0 2 . The first set of 
approximations are then introduced into the fourth equation of (6) to 
approximate the fourth non-central theoretical moment, p' . 


^Special Weibull graph paper is available from Cornell University, 
Ithaca, New York. 
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Let 7 -^) denote the ith approximation to and let denote 

the ith approximation (corresponding to 7 X ^\) to m 4 * It: should be 

relatively easy to find approximations 7 ^^ and such that the 

sample moment m^ is in the interval t M- 4 ( £ ) * ^(i+i)^* 0 nce the interval 

between 7 , . and 7 . has been narrowed sufficiently, the required 

' 1 ( 1 ) ' 1 ( 1 + 1 ) 

estimate y 1 can be obtained by a simple linear interpolation as indicated 
below. 


7 1 

t 

7 i(i) 

4(i> 

7 1 

1 

m 4 

7 i(i+i) 

1 

^ l 4(i+l) 


The required estimate of 72 can subsequently be obtained from equation 
(11). Once 7 X and 70 have been determined by equation (11), estimates 
for Oi and 0 ; j are obtained from equations (13) and (12), respectively. 

Unfortunately, the quadratic solutions in equations (12) and (13) 
result in more than one set of estimates. The problem of non-unique sets 
of estimates was considered by Karl Pearson [15] and A. C. Cohen, Jr. [1] 
in connection with mixtures of two normal distributions. Pearson sug- 
gested choosing the set of estimates which gives closest agreement between 
the sixth sample moment and the sixth theoretical moment after equating 
the first five sample moments to the corresponding theoretical moments. 
This procedure is followed for all acceptable sets of estimates. 

Cohen [1] suggests, as an alternate procedure for resolving the 
problem of multiple sets of estimates, that we might choose the set of 
estimates which produces the smallest Chi-Square Index of Dispersion when 
observed frequencies are compared with expected frequencies. 

In the general case of estimation considered here, we are concerned 
with a mixture of two Weibull distributions where the proportionality 
factor (7 is estimated graphically from the cumulative frequencies and the 
four remaining parameters are estimated by equating the first four sample 
moments to corresponding theoretical moments. When confronted with more 
than one set of acceptable estimates, we adopt Pearson's suggested pro- 
cedure and choose the set which produces the closest agreement between 


» 
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the fifth noncentral moment of the sample m^ and the theoretical 
moment ^5 given by the final equation of ( 6 ). 

The calculations described above may be carried out iteratively 
with relative ease using the computer program included in this paper as 
an Appendix. First approximations to initiate the iterative process 
may be obtained using a graphical method such as that of Kao [ 6 ]. 


III. ESTIMATION IN SPECIAL CASES 


A number of speci 1 cases that may be of practical value in which 
certain parameters are known or are restricted in some manner are con- 
sidered. 

1. 0! known. With 0 X known, we must estimate the parameters a, 
0 2 , y x and y 2 only. If we let v * 0 ~ equations (9), (10) and (11) 
become 


rn'i * a Pi + (1 - 

m^ = CC 0^^^ P2 (1 “ 

m^ 88 05 G ^^ 1 fe + (1 " 

Solving (14) for a gives 


aDn^!, 

(14) 

a)n 2 \Jf 2 , 

(15) 

a)n 3 \lf 3 . 

(16) 


m i ~ n ^i 

ei /ri Pi - nil 


(17) 


Inserting this expression for a into equation (15) and solving for n, 
we obtain after considerable algebraic manipulation 


v a *xte - ± + v 4 \|fpf - *2 + 4(m^) 2 v 2 p 2 i 2 + 4w^v 8 ^* 2 - (18) 

n - — » — — — — — • ■' — 

2 vP !* 2 - 2 m ^*2 
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Substituting this expression for n back into (17) gives 


«; - *1 


w 2 + 1 fe - ± '/(Og) 2 V* - 2B^V 2 \|r*ft> + v 4 **Pg - 4^V 3 0ife*2 + + Ab^V 2 ^* 2 - 4o^B^V0x*2 

2vpx* a - 2 b 


( 19 ) 


0x /7l Pi - *i 


V 2 *ife - B^*x ± •/(B^) 2 *® - 2“gV 2 ^fe + v 4 *^ - Am^v 3 PxBct8 + A(b^) 2 v 2 P b * 2 + Ab^v 2 P 2 + 2 - Aa^vp,*;, 

2 vPx* 2 - 2m*ife 


Now, upon substituting the expression for n from (18) and the expression 
for Oi irom (19) into equation (16), we have an equation in the two 
unknowns and y 2 which may be solved by the iterative procedure 
described in the general case. Once y± and y 2 have been determined, 
we obtain our estimates for 0 2 and OL from equations (18) and (19), 
respectively. As in the general case, the positive and negative roots 
resulting from the quadratic solution in (18) unfortunately results in 
more than one set of estimates. As before, it is suggested that the 
set of estimates which gives the closest agreement between the fifth 
noncentral moment of the sample and the corresponding "fitted" compound 
curve be used. 


2. 0 2 known. With 0 2 known we need only estimate a, 0^, 73. and y 2 . 

If we let n = 0p ^ 2 , equations (9), (10) and (11) become 


m^ - + (1 - Oi) 0 


i/y- 




( 20 ) 


ml. = av-po + (1 - a) 0^' \[r 2 . 


2 


m!, = av ’*p 3 + (1 - a) 0 


J / 7 c 


^3. 


( 21 ) 

(22) 


Solving (20) for a gives 


a = 


< - 0^ 

vPx - o 1 / 72 tl 


(23) 

% 


9 


10 


Inserting this expression for a into equation (21) and solving for v, we 
obtain after considerable algebraic manipulation 


■a Pi * n Z Pi*3 * Vn 4 p£i|£ - 2n a m^p£*z + (m^) 2 ^ - + 4(*J) 2 n 2 ft2\|rj> - 

» * — 1 • - ■ » ■ ■■■■ - i ' ■ - - ■■■ ■ - 

2ajfe - 


(24) 


Substituting this expression for v back into (23) gives 


*1 - & 


(25) 


+ (•!)*($ - + »cj)VMi» • 


* 2“*iea 


- 


Now, upon substituting the expression for v from (24) and the expression 
for CC from (25) into equation (22), we obtain an equation in the two 
unknowns y± an< * 72 which may he solved by the iterative process described 
previously. Once and y 2 are determined, we obtain our estimates for 
0! and a from equations (24) and (25). As in case 1, 0 X known, the posi- 
tive and negative roots resulting from the quadratic solution in (24) 
gives more than one set of estimates. Again, it is suggested that we 
choose the set of estimates which gives the closest agreement between 
the fifth noncentral moment of the sample and the corresponding "fitted" 
compound curve. 

3. known. If y± is known we must estimate a, 0!, 0 2 and / 2 
only. Solving equation (9) for v gives 


m^ - (1 - a) n\jrx 

— — 


(26) 


Inserting this expression for v into equation (10) and solving for n, 
we obtain 


m^l -a) tyxPs ± J - (m^) 2 a( 1 - a) + m^a^(l-a)pi\jr 2 

n m - ' - — ■ 

(l -a) 2 ylp 2 + a(i-a)pf\y 2 


(27) 

11 


Substituting this expression for n back into (26) gives 


Qpi 


mid-oOtxPa ± j- (m^) 2 0(l-a)p 2 ^a + *£ad-c0*fepj*f + m^U-a)^ 


( 28 ) 


(l-a) 2 tfpa + 0(1-0) pfte 


* 


Upon substituting the expression for n from (27) and the expression for 
v from (28) into equation (11), we obtain an equation in the two unknowns 
(X and y 2 which may be solved by the iterative procedure described in the 
general case. With a and y 2 determined we may solve equations (27) and 
(28) for Go and 0^ respectively. As before, the positive and negative 
roots which result from the quadratic solution in (27) give more than 
one set of estimates. Again, we choose the set of estimates which 
gives the closest agreement to the fifth noncentral moment of the 
sample. 

An alternate method for estimation in this case would be to estimate 
a graphically as in the general case and then solve equation (11) for y 2 
after the substitution of the expression for n from (27) and the expres- 
sion for v from (28) into equation (11). As before, 0 2 and 0 X would 
then be obtained from equations (27) and (28). 

4. y 2 known. With y 2 known, we must estimate a, 0^ 0 2 and 
only. As in case 3, y ± known, solving equation (9) for v gives 


m* - (1 - a) mjf! 

______ 


(29) 


Inserting this expression for v into equation (10) and solving for n, 
we obtain 


m 


ia-aHjJb ± 7 - (m’^aO-cOpopf.fe + m^a(l-a) 2 p 2 p|+f + 


n = 


(l-a) 2 i|ff|3 2 + aa-oOpffe 


( 30 ) 
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L 2 



When we substitute this expression for n back into (29), we get 






( 31 ) 


(l-a) 2 ^fe + ct(l-a)p^2 


Upon inserting the expression for n from (30) and the expression for v 
from (31) into equation (11), we obtain an equation in the two unknowns 
Ct and 7 x which may be solved by the iterative procedure described pre- 
viously. With a and y ^ determined in this manner, we now solve equa- 
tions (30) and (31) for 0 2 and 0 1# respectively. As before, we choose 
the set of estimates which gives the closest agreement between the fifth 
noncentral moment of the sample and the corresponding "fitted" compound 
curve. 


As in case 3, y x known, an alternate method for estimation 
would be to estimate a, the proportionality parameter graphically, and 
then solve equation ( 11 ) directly for y± after the substitution of the 
expression for n from (30) and the expression for v from (31) into 
equation ( 11 ). 

5. 71 * 1. This is a special case of case 3, y 1 known. Thus, 

case 3 is reduced to mixing an exponential distribution with a Weibull 
distribution where fj.(x) in equations ( 2 ) is an exponential distribution 
and f 2 OO is a Weibull distribution. We need only estimate a, 72 , 61 
and 02 * 


With 7 i * 1, equations (9), (10) and (11) become 

(32) 

(33) 

(34) 

where the only change in notation from previous cases is v * 0 !. 


m^ * av + (1 - a) mjf! 
m^ * 2 av 2 + (1 - a) n 2 -\|f 2 
m^ = 6 ov 3 + (1 - Ct) n 3 ijr 3 , 
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Solving equation (32) for v gives 


v 


- (1 - Oi) n\|ri 

a ~~~ 


( 35 ) 


Substituting this expression for v into equation (33) and solving for 
n gives 


2m , 1 (l-a)i(f 1 ± N^m^l-a) 2 ^! - 2(m , i ) 2 a(l-Qj)\lr 2 + m^a^l-a)^ 


n = 


(36) 


2(l-a) 2 \| / 2 + a(l-a)ty 2 


Inserting this expression for n back into (35) gives 


v = 


m! 


a 




2m , 1 (l-a)^ 1 ± J 2m l 2 a(l - 2(m' 1 ) 2 a(l-a)\Jr 2 + 


2a(l-cu)\)/ 2 + 


(37) 


Now, upon substituting the expression for n from (36) and the expres- 
sion for v from (37) into equation (34), we obtain an equation in the 
two unknowns a and y 2 which we may solve using the iterative procedure 
described in previous cases. With minor simplifications, equation (34) 
becomes 


m 


m ' 


6Q 


“ “ 'h 


^(l-a)^ ± N^aa-a) 2 ^ - 2(m^) 2 a(l-a)t 2 + m^a 2 (l-aH 2 

2a(l-a)\|r* + a 2 ^ 


( 38 ) 


(l-a)Xr 3 


2ra^(l-a)\lr 1 ± s/2m^a(l-a) 2 ^ - 2(m^) 2 a(l-a)t2 + m^0 2 (l-a)\lf 2 

2(l-a) 2 ^ + a(l-a)\|r 2 
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Once a and y z have been determined from equation (38) we may obtain 
our estimates for 0 2 and 0^ from equations (36) and (37), respectively. 

As in the other cases, we choose the set of estimates which 
gives the closest agreement between the fifth noncentral moment of the 
sample and the corresponding "fitted" compound curve. 

An alternate method of estimation would be to estimate a 
graphically as in the general case and then solve equation (38) directly 
for 7 2 » As above, estimates for 0 2 and 0 X would then be obtained from 
equations (36) and (37). 

6. 7i * 72 “ unknown. Changing our notation slightly, we will let 
7 1 * 72 * 7. Thus, we must estimate a, 7, 0 X and 0 2 . Now, let 



With this notation, equations (9), (10) and (11) become 


m^ * avp! + (1 - a) np! 

(39) 

m^ * av 2 p 2 + (1 - a) n 2 (3 2 

(40) 

m^ * av 3 {3 3 + (1 - a) n 3 f3 3 . 

(41) 


As before, solving equation (39) for v gives 


m^ - (1 - a) np x 
OPi 


(42) 
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Inserting this expression for v into equation (40) and subsequently 
solving for n, we have 


m'jCl-cOfe ± v ( l-a)02 Oi[m' 2 p>l - (m^) 2 £ 2 ] 

n = 

(l-a)Pip 2 


(43) 


Substituting this expression for n back into (42) gives 


m,' m^(l-a)pp ± Ja( l-GOoCm^ - (m^) 2 p 2 ] | 

af3 l - Of Pi p 2 J 


(44) 


Upon substituting the expression for n from (43) and the expression for 
v from (44) into equation (41), we obtain an equation in the two unknowns 
a and y which may be solved by the iterative process described in the 
general case. We may now solve equations (43) and (44) for 0 2 and 0 1# 

As in the other cases, the quadratic solution in equation (43) results 
in more than one set of estimates. Again, we use the set of estimates 
which gives the closest agreement to the fifth noncentral moment of the 
sample. 

As before, an alternate solution would be to estimate CL graph- 
ically and then solve the resulting equation (41) directly for y. 

In the event that cl is known in advance of sampling we may 
solve equation (41) directly for y and subsequently obtain 0 2 and 0! 
from (43) and (44). 

7. 7x ~ 7a ~ known. If we let y x * y 2 * y, the first three equa- 
tions of (7) become 
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Thus, we must estimate ct, 0* and 0 2 only. For simplification, we will 
let 



Now, equations (45), (46) and (47) become 


a0^ 7 + (1 - a) 

9 2 /7 “ C 1 

(^8) 

c#“ /7 + (l - a) 

4 7 ' '2 

(49) 

cb^ 7 + (1 - a) 

e 3 / 7 - c 3 . 

(50) 


Solving equation (48) for a we have 



ct 


6 


1/7 . e i h 


(51) 


Substituting the expression for a from (51) into (49) and (50), we have 
after considerable algebraic manipulation the equations 


e 1 / 7 <4 /7 - * < h ) - e, (52) 

ei /y tg* (e 1 / 7 + e^ 7 ) - Cl (ef 7 + e 1 / 7 ^ + e| /7 ) - c 3 . (53) 

Inserting the expression for 6^^ 0*^ from (52) into the left side of 
equation (53), we obtain after simplifying 

O^ 7 02 /r c ! - c 2 (e^ 7 0* /7 ) + C 3 - 0. (54) 
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Solving (54) for 0^ 7 gives 


e 


Jy _ C 2 “ c 3 

^J 7 Cl - c 2 


(55) 


Substituting this expression for 0^ 7 back into equation (52), we have 
after simplification the quadratic equation in 0^'? 


(co - c 2 ) [0^ 7 ] 2 - (c 3 - c-,c 2 ) 6^ 7 + (c ± c 3 - c 2 ) * 0, 


(56) 


whose solution for 0^ is 


0. = 
l 


2 (cl - Co) 


-l 


“17 


(-c 3 + c jC 2 ) ± (c 2 - 6c 1 c 2 c 3 - 3c 2 c 2 + 4c^c 3 + 4 c 


(57) 


Without loss of generality, we may impose the restriction that 0! < 0 2 . 
Thus, we obtain 0! and 0 2 from equation (57) using the negative and 
positive roots, respectively. Once we have determined 0! and 0 2 , we 
obtain our estimate for Ct from equation (51). 


8. 0i = 0 2 = known. If we let 0^ = 0 2 = 0, the first three equa- 
tions of (7) become 


m^ = tfG 1 ^ 71 Pi + (1 - a) Q 1 ^ 2 \Jr2. 


mV = ao 2 ^ 71 p 2 + (l - a) e 2 ^ 72 \Jr 2 


mV = ao 71 p 3 + (1 - a) 0 "' \jr 3 


/ 7 


3/72 


(58) 

(59) 

(60) 


y 

* 
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Thus, we must estimate the parameters a, y± and y 2 only. As previously, 
solving (58) for a gives us 


m' x - 9 1/ ^ +1 

7^1 - e 1 ^ ’ 


(61) 


Substituting this expression for a into equation (59), we have after 
simplifying 


mi (e 2/7l fe - e^a) + rn- (e l/ ^ H - e 1 ^) 


- e 2/7l 3 2 e l/72 ti + e 2/72 t 2 e l/7l Pi * o. 


(62) 


Equation (62) is an equation in the two unknowns y± and y 2 which may 
be solved by the iterative procedure described for the general case of 
estimation. Once y± and y 2 are determined, we obtain our estimate for 
a from equation (61). 

9. 9x * 02 ® unknown. For this special case, it is suggested that 
we solve for OC graphically and subsequently follow the procedure out- 
lined for the general case of estimation. 


IV. AN ILLUSTRATIVE EXAMPLE 


To illustrate the estimation procedure outlined in this paper for 
the general case, we will consider a sample of 2000 observations selected 
from a mixed population constructed by combining two Weibull distributions 
with 7i = 2.0000, 0 X = 10.0000, y 2 = 0.8000, 0 2 = 1.0000 and a = ( 0.8000. 

The sample is summarized in Table I. For the sample selected, = 2.4708, 
m^ = 8.6270, m' 3 = 36.3408, m^ = 174.9190 and m^ = 935.3733. 
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TABLE I 


A SAMPLE OF 2000 OBSERVATIONS FROM A MIXED WEIBULL POPULATION 


CLASSES 

CLASS MARKS 

h 


f 

CUMULATIVE FREQUENCY 
IN PER CENT 1 

0 - 0.5 

0.25 

40 

175 

215 

10.75 

0.5 - 1.0 

0.75 

113 

78 

191 

20.30 

1.0 - 1.5 

1.25 

170 

47~ 

217 

31.15 

1.5 - 2.0 

1.75 

205 

30 

235 

42.90 

2.0 - 2.5 

2.25 

216 

20 

236 

54.70 

2.5 - 3.0 

2.75 

206 

14 

220 

65.70 

3.0 - 3.5 

3.25 

180 

10 

190 

75.20 

3.5 - 4.0 

3.75 

147 

7 

154 

82.90 

4.0 - 4.5 

4.25 

112 

5 

117 

38.75 

4.5 - 5.0 

4.75 

80 

4 

84 

92.95 

5.0 - 5.5 

5.25 

54 

3 

57 

<>5. 80 

5.5 - 6.0 

5.75 

34 

2 

36 

97.60 

6.0 - 6.5 

6.25 

20 

1 

21 

98.65 

6.5 - 7.0 

6.75 

11 

1 

12 

99.25 

7.0 - 7.5 

7.25 

6 

1 

7 

99.60 

7.5 - 8.0 

7.75 


1 

4 

99.80 

8.0 - 8.5 

8.25 

1 

1 

2 

99.90 

8.5 - 9.0 

8.75 

1 


1 

99.95 

9.0 - 9.5 

9.25 

1 


1 

100.00 



1600 

400 

2000 



In the above table, f x = class frequencies from f 1 (x), f 2 = class fre- 
quencies from f ;? (x), and f = class frequencies from the resulting mixed 
distribution. 

Figure 3 is a graph of the compound density function and its compo- 
nent distributions. Notice at this point that y ^ 1 produces a J-shaped 
function while y > 1 produces a bell-shaped curve. 

Employing the graphical technique described in Section I provides an 
estimate of a, the proportionality parameter, equal to 0.80 as shown on 
Figure 4. 


.!() 









Once our estimate of a has been determined graphically, we solve 
equation ( 11 ) iteratively for first approximations to y 1 and y 2 . Corre- 
sponding first approximations for 0 ! and 0 2 are obtained from equations 
( 13 ) and ( 12 ), respectively. Each set of first approximations is intro- 
duced into the fourth equation of (6) to approximate the fourth noncentral 
theoretical moment, p^. Each set of first approximations is also sub- 
stituted into the final equation of (6) to approximate the fifth non- 
central theoretical moment p^, as suggested by Pearson [15]. The set of 
estimates which gives the closest agreement between the fifth noncentral 
moment of the sample m^ and the corresponding "fitted" compound curve 
given by the final equation of (6) is the required set of estimates. 
Utilizing the computer program given in the Appendix, we find that our 
estimate of y x lies between 1.90 and 2.10. The corresponding estimates 
for the remaining parameters, pi and p f 5 , are as follows: 


7 1 

72 

6i 

02 

• 

P-4 

i 

M-5 

1.90 

2.30 

11.52 

1.29 

343.8875 

1926.6399 

2.10 

0.80 

6.74 

0.82 

64.6006 

287.5141 


Our next approximation to y x *- s obtained by simple linear interpolation 
as indicated below. 


7 1 

t 

1.90 

343.8875 

2.0210 

174.9190 

2.10 

64.6006 


Substituting y-± ■ 2.0210 into equation (11), solving for y 2 and subse- 
quently solving equations (13) and (12) for 0 X and 0 2 , we obtain 
72 * 0 . 8339 , 0 X = 10.1496 and 0 2 = 0.9929. Introducing this set of 
approximations into the final equation of (6) gives p ' 5 = 935.3646. This 
value of ps is in such close agreement with the corresponding sample 
moment m' 5 = 935.3733 that we are justified in accepting this set of 
approximations as our final estimates. However, if further preciseness 
is desired, this iterative process may be continued to any desired degree 
of accuracy. 
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FIGURE 







The computer program outlined in the Appendix gives all possible 
solutions to the estimating equations. Specifically, equations (12) 

and (13) produce four solutions resulting from the four combinations of | 

the positive and negative signs prefixing the radicals; i.e., the com- 
binations are (-,-), (-,+), (+,-) and (+,+). It was discovered that 
the computer program printout gave closest agreement between the theo- 
retical moment [i 5 and the sample moment m^ when the combination (-,-) 
was used. 

Comparisons of expected with observed frequencies, along with a com- 
parison of observed and expected distribution functions, are presented in 
Table II. It also seems appropriate to compare the observed frequencies 
for the mixed sample with these same frequencies assuming that the sample 
fits a simple Weibull distribution. This will prove or disprove that our 
mixed sample could be treated us a simple Weibull distribution. Cohen's 
maximum likelihood estimation procedure [4], was used to derive estimates 
for the parameters, and the resulting expected frequencies were obtained. 

Notations used in Table II are as follows: 

f 0 = observed frequencies for mixed data. 

F 0 = observed distribution function for mixed data. 

f e = expected frequencies using estimates derived in this 
report. 

F e = expected distribution function using estimates derived 
in this report. 

« 

f es = expected frequencies assuming data fits a simple Weibull 
dis tribution. 

F eg = expected distribution function assuming data fits a simple 
Weibull distribution. 

The agreement between observed frequencies for the sample and 
expected frequencies using the derived estimates is very good as shown 
in Table II. The corresponding observed and expected distribution func- 
tions are in very close agreement with the maximum absolute difference 
of 0.0030 occurring at the class 3.5 - 4.0. Comparing this value with 
the Kolmogorov- Smirnov statistic, we see that 

D N = D 3 ° 00 = 0.0364 
a • oi 


gives an excellent M goodness-of-f it" at the 99 percent level of confi- 
dence. 


f 
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TABLE II 


OBSERVED AND EXPECTED FREQUENCIES FOR 2000 OBSERVATIONS 
FROM A MIXED WEIBULL DISTRIBUTION 


CUSSES 

f 0 


F es 

F o 

F e 

F es 

0 - 0.5 

215 

211.0 

143.3 

.1075 

.1055 

.0717 

0.5 - 1.0 

191 

192.9 

243.2 

.2030 

.2020 

.1932 

1.0 - 1.5 

217 

219.1 

271.9 

.3115 

.3116 

.3292 

1.5 - 2.0 

235 

237.8 

265.8 

.4290 

.4305 

.4621 

2.0 - 2.5 

236 

238.9 

239.9 

.5470 

.5499 

.5821 

2.5 - 3.0 

220 

222.1 

204.6 

.6570 

.6610 

.6843 

3.0 - 3.5 

190 

191.7 

166.8 

.7520 

.7569 

.7677 

3.5 - 4.0 

154 

154.3 

130.9 

.8290 

.8340 

.8332 

4.0 - 4.5 

U7 

116.2 

99.4 

.8875 

.8921 

.8828 

4.5 - 5.0 

84 

82.1 

73.3 

.9295 

.9332 

.9195 

5.0 - 5.5 

57 

54.6 

52.6 

.9580 

.9605 

.9458 

5.5 - 6.0 

36 

34.3 

36.8 

.9760 

.9776 

.9642 

6.0 - 6.5 

21 

20.4 

25.2 

.9865 

.9878 

.9768 

6.5 - 7.0 

12 

i 11.5 

16.9 

.9925 

.9935 

.9852 

7.0 - 7.5 

7 

j 6.2 

11.1 

.9960 

.9966 

.9908 

7.5 - 8.0 

4 

3.2 

7.1 

.9980 

.9982 

.9943 

8.0 - 8.5 

2 

1.6 

4.5 

.9990 

.9990 

.9966 

8.5 - 9.0 

1 

.8 

2.8 

.9995 

.9994 

.9980 

9.0 - 9.5 

1 

.4 

1.7 

1.0000 

.9997 

.9988 


Comparing F c with F es shows a maximum absolute dilference in the 
distribution functions of 0.0358 occurring at the class 0 - 0.5. Since 
the Kolmogorov- Smirnov statistic 

D N * D 2000 = 0.0304, 
a • 05 * 

this value of 0,0358 is sufficient to reject the hypothesis that our 
mixed distribution could be considered a simple Weibull distribution. 

As an alternate goodness-of-f it test for agreement between observed 
frequencies and expected frequencies using the derived estimates, the X 2 
index was calculated and the results are as follows: 
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X 2 = 1.2669 d.f. = 10 P(X 2 > 1.2669 * 0.995). 

Thus, in consideration of the low X 2 index of dispersion, we may conclude 
that we have an excellent fit for the chosen sample. 


V. CONCLUSIONS 


It is an accepted fact that the method of moments is not (except • 

for distributions such as the normal, binomial, and Poisson) the most 1 

efficient procedure for estimating the parameters of a frequency distri- f 

bution. Methods having maximum efficiency, such as the method of maxi- * 

mum likelihood are more desirable. However, in the case of the mixed { 

Weibull distribution with its five parameters, the maximum likelihood 1 

estimating equations are almost intractable. 

The central, noncentral, and factorial moments of this distribution 
were invest gated, and it was discovered that the noncentral moments 
possessed optimum characteristics for the development of estimating 
equations. A comparison of noncentral sample moments with the theo- 
retical moments for the sample selected showed an error of 0,09 percent 
for the first moment, 0.44 percent for the second moment, 1.64 percent 
for the third moment, 4.94 percent for the fourth moment, and 12.73 per- 
cent for the fifth moment. This progressive increasing percentage of 
nonagreement between sample moments and theoretical moments illustrates 
the large sampling errors involved in the use of higher order moments. 

Using only the first three sample moments with their relatively low per- 
centage of error in the estimating equations produced very good agree- 
ment between final estimates and the population parameters y lt y 2 , 0i 
and Qo. Sheppard's corrections for grouped data were applied to the 
sample moments in order to increase this agreement, but produced no 
significant change in the results; therefore, the corrections were not 
used in the estimating equations. 

This paper presents an estimating procedure that produced very good 
results for the sample chosen. The use of electronic digital computers 
makes the somewhat involved method practical and applicable to experi- 
ments in which the mixed Weibull distribution is the appropriate 
statistical model. 

In the author's opinion, the estimating procedures outlined in this 
report warrant further investigation for increase in efficiency, and 
improvement and simplification of form for the estimating equations 
involved. 
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APPENDIX 

COMPUTER PROGRAM FOR ESTIMATING THE PARAMETERS 
OF A MIXED WEIBULL DISTRIBUTION 


The author wishes to acknowledge the assistance of Mr. Raymond 
Smith of the Computation Laboratory, MSFC, for his work on the CDC 3200 
computer program. 
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COMPUTER PROGRAM FLOW CHART FOR ESTIMATING THE PARAMETERS OF A MIXED WEIBULL DISTRIBUTION 







LOAD DATA 


FIRST, TWO TABLES ARE READ IN STARTING IN XI AND FI 

DGI - A?! 

DG2 ~ A 72 
EG1 - TERMINAL 
EG2 - TERMINAL / 2 
BGAMl - BEGINNING 7i 
BGAM2 - BEGINNING J 2 

EN - SAMPLE SIZE 2000 USED FOR THIS PROGRAM 

ALP - CX .8 USED 

NVI - NUMBER OF VALUES IN THE TABLES 

K - AN INTEGER 1-4 

THE COMPUTED [i 5 SHOULD APPROACH m' 5 

THIS CAN BE ACCOMPLISHED BY A SERIES OF RUNS VARYING A y lt ty 2 , y lt AND / 


LEGEND 


rn'i - PM1 

Oi - TH1 

4 - PM2 

G 2 - TH2 

ntj - PM3 


m* - PM4 


4 


ml - PM5 




Pi - Bl 



fe - B2 
(3 j - B3 
\|fi - PSI1 
^ - PS I 2 
\|f 3 - PSI3 
fi5 - UP5 

N - EN SAMPLE SIZE 

n - EN ONCE THE ABOVE N IS USED THE LOCATION EN IS NO LONGER NEEDED. 
IT IS USED TO STORE n. 

v - V 

ENP - IN EQUATION 12, IF THE POSITIVE VALUE OF THE RADICAL IS USED. 

ENM - IF THE NEGATIVE VALUE IS USED. 

VP - IN EQUATION 13, IF THE POSITIVE VALUE OF THE RADICAL IS USED. 

VM - IF THE NEGATIVE VALUE IS USED 

THE FOLLOWING FOUR COMBINATIONS ARE POSSIBLE, DEPENDING UPON WHAT NUMBER 
IS READ INTO LOCATION K: 

v n k REGARDLESS OF WHICH COMBINATION IS USED, BEFORE THE 

+ + 1 PROGRAM IS CONTINUED, THE VALUE OF EQUATION 13 IS 

+ - 2 STORED IN V AND THE VALUE OF EQUATION 12 IS STORED 

+ 3 IN EN. 

- 4 
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